Probability axioms

In probability theory, the probability P of some event E, denoted $P(E)$ , is usually defined in such a way that P satisfies the Kolmogorov axioms, named after Andrey Kolmogorov, which are described below.

These assumptions can be summarised as: Let (Ω, F, P) be a measure space with P(Ω)=1. Then (Ω, F, P) is a probability space, with sample space Ω, event space F and probability measure P.

An alternative approach to formalising probability, favoured by some Bayesians, is given by Cox's theorem.

1 First axiom
2 Second axiom
3 Third axiom
4 Consequences
5 Proofs
6 More consequences
7 See also
8 Further reading
9 External links

First axiom

The probability of an event is a non-negative real number:

$P(E)\in\mathbb{R}\and P(E)\geq 0 \qquad \forall E\in F$

where $F$ is the event space and $E$ is any event in $F$ . In particular, $P(E)$ is always finite, in contrast with more general measure theory.

Symbols: P(E)∈ℝ ∧P(E)≥0 ∀E∈F

Second axiom

Third axiom

This is the assumption of σ-additivity:

Any countable sequence of pairwise disjoint (synonymous with mutually exclusive) events $E_1, E_2, ...$ satisfies

$P(E_1 \cup E_2 \cup \cdots) = \sum_{i=1}^\infty P(E_i).$

Some authors consider merely finitely additive probability spaces, in which case one just needs an algebra of sets, rather than a σ-algebra.

Consequences

From the Kolmogorov axioms, one can deduce other useful rules for calculating probabilities.

Monotonicity

$P(A)\leq P(B)\quad \text{if}\quad A\subseteq B.$

The probability of the empty set

$P(\emptyset)=0.$

The numeric bound

It immediately follows from the monotonicity property that

$0\leq P(E)\leq 1\qquad \text{for all } E\in F.$

Proofs

The proofs of these properties are both interesting and insightful. They illustrate the power of the third axiom, and its interaction with the remaining two axioms. When studying axiomatic probability theory, many deep consequences follow from merely these three axioms.

In order to verify the monotonicity property, we set $E_1=A$ and $E_2=B\backslash A$ , where $\quad A\subseteq B \text{ and } E_i=\emptyset$ for $i\geq 3$ . It is easy to see that the sets $E_i$ are pairwise disjoint and $E_1\cup E_2\cup\ldots=B$ . Hence, we obtain from the third axiom that

$P(A)%2BP(B\backslash A)%2B\sum_{i=3}^\infty P(\emptyset)=P(B).$

Since the left-hand side of this equation is a series of non-negative numbers, and that it converges to $P(B)$ which is finite, we obtain both $P(A)\leq P(B)$ and $P(\emptyset)=0$ . The second part of the statement is seen by contradiction: if $P(\emptyset)=a$ then the left hand side is not less than

$\sum_{i=3}^\infty P(E_i)=\sum_{i=3}^\infty P(\emptyset)=\sum_{i=3}^\infty a = \begin{cases} 0 & \text{if } a=0, \\ \infty & \text{if } a>0. \end{cases}$

If $a>0$ then we obtain a contradiction, because the sum does not exceed $P(B)$ which is finite. Thus, $a=0$ . We have shown as a byproduct of the proof of monotonicity that $P(\emptyset)=0$ .

More consequences

Another important property is:

$P(A \cup B) = P(A) %2B P(B) - P(A \cap B)$

This is called the addition law of probability, or the sum rule. That is, the probability that A or B will happen is the sum of the probabilities that A will happen and that B will happen, minus the probability that both A and B will happen. This can be extended to the inclusion-exclusion principle.

$P(\Omega\setminus E) = 1 - P(E)$

That is, the probability that any event will not happen is 1 minus the probability that it will.

External links

The Legacy of Andrei Nikolaevich Kolmogorov Curriculum Vitae and Biography. Kolmogorov School. Ph.D. students and descendants of A.N. Kolmogorov. A.N. Kolmogorov works, books, papers, articles. Photographs and Portraits of A.N. Kolmogorov.